Parsing Free Word-Order Languages in Polynomial Time

نویسندگان

  • Tilman Becker
  • Owen Rambow
چکیده

Long-Distance Scrambling is a word-order phenomenon which is “doubly unbounded” in that (i) more than one element can move, and (ii) movement can be unbounded. In (Becker et al., 1991), we argue that scrambling is beyond TAG by assuming that elementary trees express a complete predicate-argument structure. In (Becker et al., 1992), we show that no formalism in the class LCFRS (which includes TAG) can derive scrambling. (Becker et al., 1991) proposes two variants of the TAG formalism which can derive scrambling while still preserving most of the desirable properties of TAGs (i.e., an extended domain of locality and the factoring of recursion). However, little is known about the formal and computational properties of those systems. (Rambow, 1994) proposes V-TAG, which is closely related to one of the previously proposed varaiants, but redefines the derivation relation. V-TAG can derive the relevant set of sentences and also cases where scrambling co-occurs with long-distance topicalization (a separate linguistic phenomenon also found in English, in which a single element moves into sentenceinitial position):

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تأثیر ساخت‌واژه‌ها در تجزیه وابستگی زبان فارسی

Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...

متن کامل

Diachronic Trends in Word Order Freedom and Dependency Length in Dependency-Annotated Corpora of Latin and Ancient Greek

One easily observable aspect of language variation is the order of words. In human and machine natural language processing, it is often claimed that parsing freeorder languages is more difficult than parsing fixed-order languages. In this study on Latin and Ancient Greek, two wellknown and well-documented free-order languages, we propose syntactic correlates of word order freedom. We apply our ...

متن کامل

Developing a Minimalist Parser for Free Word Order Languages with Discontinuous Constituency

We propose a parser based on ideas from the Minimalist Programme. The parser supports free word order languages and simulates a human listener who necessarily begins sentence analysis before all the words in the sentence have become available. We first sketch the problems that free word order languages pose. Next we discuss an existing framework for minimalist parsing, and show how it is diffic...

متن کامل

Robust and efficient semantic parsing of free word order languages in spoken dialogue systems

This paper presents a semantic parser for spoken dialogue systems. The parser is designed especially for the analysis of free word order languages by providing a feature called orderindependent matching. We describe how this feature allows writing of rules for free word order languages in an elegant way (using German as example language) and how it increases the robustness against speech recogn...

متن کامل

Partially Ordered Multiset Context-free Grammars and Free-word-order Parsing

We present a new formalism, partially ordered multiset context-free grammars (pomsCFG), along with an Earley-style parsing algorithm. The formalism, which can be thought of as a generalization of context-free grammars with partially ordered right-hand sides, is of interest in its own right, and also as infrastructure for obtaining tighter complexity bounds for more expressive context-free forma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/cmp-lg/9411008  شماره 

صفحات  -

تاریخ انتشار 1994